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Background 

Bilaterally symmetric nervous systems, such as those found in insects and vertebrates, 
have special midline structures that establish a partition between the two mirror image halves. 
Axons that link the two sides of the nervous system project toward and across the midline, 
forming axon commissures. These commissural axons project toward the midline, at least in 
part, by responding to long-range chemoattractants emanating from the midline. One 
important class of midline chemoattractants are the netrins (Serafini et al., 1994; Kennedy et 
al., 1994), guidance signals whose structure, function, and midline expression is evolutionarily 
conserved from nematodes and fruit flies to vertebrates (Hedgecock et al., 1990; Wadsworth et 
al., 1996; Mitchell et al., 1996; Harris et al., 1996). The attractive actions of netrins appear to 
be mediated by growth cone receptors of the DCC subfamily of the immunoglobulin (Ig) 
superfamily (Keino-Masu et aTj^996TChan et aL7T996f Kolodziej~et al— 1996): 

The midline also provides important short-range guidance signals. This is best 
illustrated by considering the different classes of axon projections in the spinal cord of 
vertebrates or the nerve cord of insects. Although some growth cones extend away from the 
midline, most extend towards or along the midline during some segment of their trajectory. 
Certain classes of growth cones either extend towards the midline or longitudinally along it 
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and yet never cross it. Most growth cones (-90% in the Drosophila CNS), however, do cross 
the midline. After crossing, the majority of these growth cones turn to project longitudinally, 
growing along or near the midline. Interestingly, these axons never cross the midline again, 
despite navigating in the vicinity of other axons that continue to cross. 

What midline signals and growth cone receptors control whether growth cones do or 
do not cross the midline? After crossing once, what mechanism prevents these growth cones 
from crossing again? Studies in the chick (Stoeckli and Landmesser, 1995; Stoeckli et al., 
1997) and grasshopper (Myers and Bastiani, 1993) embryos have led to the suggestion that the 
midline contains a contact-mediated repellent, and that commissural growth cones must 
overcome this repellent to cross the midline. For example, this notion that the midline can be 
repulsive even to growth cones that cross it is supported by time-lapse imaging of the first 
commissural growth cone in the grasshopper embryo. On contacting the midline, this growth 
cone often abruptly retracts, although ultimately it overcomes the repulsion and crosses the 
midline. 

One approach to find the genes encoding the components of such a midline guidance 
system is to screen for mutations in which either too many or too few axons cross the midline. 
Such a large-scale mutant screen was previously conducted in Drosophila and led to the 
identification of two key mutations: commissureless (comm) and roundabout (robo) (Seeger et 
al., 1993; reviewed by Tear et al., 1993). In comm mutant embryos, commissural growth 
cones initially orient toward the midline but then fail to cross it and instead recoil and extend 
on their own side, comm encodes a novel surface protein expressed on midline cells. As 
commissural growth cones contact and traverse the CNS midline, Comm protein is apparently 
transferred from midline cells to commissural axons (Tear et al., 1996). In robo mutant 
embryos, many growth cones that normally extend only on their own side instead now project 
across the midline, and axons that normally cross the midline only once instead appear to 
cross and recross multiple times (Seeger et al, 1993; Kidd et al., 1997). Double mutants of 
comm and robo display a robo-like phenotype. " 

Here we disclose the characterization of robo across animal species, robo encodes a 
new class of guidance receptor with 5 Ig domains, 3 fibronectin (FN) type III domains, a 
transmembrane domain, and a long cytoplasmic domain. Robo defines a new subfamily of Ig 
superfamily proteins that is highly conserved from fruit flies to mammals. The results of 
protein expression and transgenic rescue experiments indicate that Robo functions as the 
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gatekeeper controlling midline crossing and that Robo responds to an unknown midline 
repellent. 

SUMMARY OF THE INVENTION 
The invention provides methods and compositions relating to Robol and Robo2, 
collectively Robo) polypeptides, related nucleic acids, polypeptide domains thereof having 
Robo-specific structure and activity, and modulators of Robo function. Robo polypeptides 
can regulate cell, especially nerve cell, function and morphology. The polypeptides may be ' 
produced recombinantly from transformed host cells from the subject Robo polypeptide 
encoding nucleic acids or purified from mammalian cells. The invention provides isolated 
Robo hybridization probes and primers capable of specifically hybridizing with natural Robo 
genes, Robo-specific binding agents such as specific antibodies, and methods of making and 
using the subject compositions in diagnosis (e.g. genetic hybridization screens for Robo 
transcripts), therapy (e.g. Robo inhibitors to promote nerve cell growth) and in the 
biopharmaceutical industry (e.g. as immunogens, reagents for isolating Robo genes and 
polypeptides, reagents for screening chemical libraries for lead pharmacological agents, etc.). 

BRIEF DESCRIPTION OF THE FIGURES 
Figure 1 Organization of the roundabout Genomic Locus 

(A) Cosmid chromosome walk through the 58F/59A region of the 2nd chromosome. The 
position of deficiency breakpoints within the cosmids used are shown in the top two rows. 
Identified transcripts from the walk are shown below the cosmids. The 12-1 transcript 
corresponds to the robo gene; the direction of transcription is distal to proximal. The location 
of the 16kb Xbal genomic rescue fragment is indicated below. 

(B) Position and size of introns within the robo transcript. Coding sequence is indicated by 
the thicker part of the line. Introns are represented by gaps. The transcript is shown 3 f -5 f to 
reflect its orientation in (A). 

Figure 2 Structure of Robo Protein 

Schematic of the structure of Drosophila Robo protein. The position of the Immunoglobulin 
(Ig), fibronectin (FN) and transmembrane (TM) domains and the amino acid substitution in 
robo 6 are shown. Percent amino acid identity between Drosophila Robo 1 and Human Robo 1 
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DETAILED DESCRIPTION OF THE INVENTION 


The nucleotide sequences of exemplary natural cDNAs encoding drosophila 1, 
drosophila 2, C. elegans, human 1, human 2 and mouse 1 Robo polypeptides are shown as 
SEQ ID NOS:l, 3, 5, 7, 9 and 11, respectively, and the full conceptual translates are shown as 
SEQ ID NOS:2, 4, 6, 8, 10 and 12. The Robo polypeptides of the invention include 
incomplete translates of SEQ ID NOS:l, 3, 5, 7, 9 and 11 and deletion mutants of SEQ ID 
NOS:2, 4, 6, 8, 10 and 12, which translates and deletion mutants have Robo-specific amino 
acid sequence, binding specificity or function. Preferred translates/deletion mutants comprise 
at least a 6, preferably at least an 8, more preferably at least a 32, most preferably at least a 64 
residue domain of the translates. In a particular embodiment, the deletion mutants comprise 
one or more structural/functional Robo immunoglobulin, fibronectin or cytoplasmic motif 
domains described herein. For example, soluble forms of the disclosed Robo polypeptides 
which comprise one or more Robo IG domains, and especially fusions of two or more Robo 
IG domains, particularly fusions of IG#1 and #2, provide competitive inhibitors of Robo- 
mediated signaling. Exemplary such deletion mutants and recombined deletion mutant ^pu^^i 


fusions include human Robo 1 (SEQ ID NO:8) residues 1-67; 68-167; 168-259; 260-350; 351- 
451; 1-167; 1-259; 1-350; 1-451; 68-259; 1-67 joined to 168-259; and 1-67 joined to 260-451. 

Other deletion mutants provide Robo-specific antigens and/or immunogens, especially 
when coupled to carrier proteins as described below. Generic Robo-specific peptides are 
readily apparent as conserved regions in the aligned Robo polypeptide sequences of Table 1. 

$ Table 1. Sequence Alignment of Robo Family Members: The complete amino acid alignment 
of the predicted Robo proteins encoded by drosophila robo 1 (Dl, SEQ ID NO:2) and Human 
robo 1 (HI, SEQ ID NO:8) are shown. The extracellular domain of C.elegans robo (CE)(SEQ 

ID-NO : 67residues-l-937) ;~S ax~3 rZallen-etal— 1 997)r the-extracellulardomain-of£>ro56>p/z Ha- 
nbo 2 (D2)(SEQ ID NO:4, residues 1-942), and partial sequence of Human robo 2 (H2)(SEQ 
ID NO: 10, residues 1-284) are also aligned. The D2 sequence was predicted by the gene- 
finder program Grail. The position of immunoglobulin domains (Ig), fibronectin domains 
(FN), the transmembrane domain (TM), and conserved cytoplasmic motifs are indicated. The 
extracellular domain of rat robo 1 is nearly identical to HI. 
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mH PMHpENHAIaRSTSTTNNPSrsRSSRMWLlpAWLLLVLVASNGLP 47 Dl 

m. FNRKTLlCTi . 11V1QA vlrsFCEDASNlA 3 0 CE 

mKWKHVPFlVMiSllSlSpNHLFLaQLIPDPEDvErG . NDHGTPIpTSDNDDNSLGYTGS 59 HI 

J 

>IG #1 

AVrGQYQSpriiehpTdlwKknepatlnckVegKpEptiewf kdgepvStn. . EKKshr 105 Dl 

GENpriiehpMdTTvPknDpFtFncQaegNptptiQwfkdgRELKt . . . dTGshr D2 

pViiehpIdVwsRgSpatlncGaK. PStAKiTwykdgQpvItnkEQVNshr 81 CE 

RLrQEDFPpriVehpSdllvskgepatlnckaegRptptiewykGgeRvEtDkDdPRshr 119 HI 

>IG #2 

VQFKDgAlf f YriMQgkkeQ . . dGgEywcvaknRVgQavsrHaslqlavlrddf rvepKd 163 Dl 

iMlpAgGlf f lkvIhSrReS . . dagTywcEakneFgVaRsrnaTlqyavlrdEf rLepAN D2 

iVlDTgslf LlkvNSgkNGKDSdagAyYcvaSneHgeVKsNEGslKLaMlrEdf rvRpRT 141 CE 

MLlpSgslf f IriVhgrkSRP . dEgVyVcvaRnYLgeavsHnaslEvallrddf rQNpSd 178 HI 

trvaKgeTallecgppKglpeptLIwIkdgVplddLKAmSFGASSrVrivdggnlLiSNv 223 Dl 

trvaQgeValmecgAprgSpepQiswrkNgQTlNL VGNKririvdggnlAiQEA ' D2 

vQALGgeMavlecSpprgFpepWswrkdDKElRI . QDmP rYTLHSDgnl I ipPv 195 CE 

vMvaVgePavmecQpprgHpeptiswKkdgSpldd KDEri . TTRggKlMiTYT 23 0 HI 

>IG #3 - 

EPIdEgNyKcIaQnLvgtresSYaKlIvQvkpYfMkepkdqVMLYgQTaTfHcSvggdpP 2 83 Dl 

rQsdDgRyqcyVKnVvgtresATaFlKvHvrpFLIRGpQnqtAVvgSsvVfQcrlggdpL D2 

DRsdSgTyqcvaNnmvgerVsNPaRlSvFekpKf EQepkdMtvDvgAAvLf DcrvTgdpQ 255 CE 

rKsdAgKyVcvGTnmvgeresEVaElTvLerpSf VkRpSnLAvTvDDsaEf KcEARgdpV 2 90 HI 


pKvlwkk . . EEgnlpvsrA RiLHdEKslEiSNItpTdegTyvceaHnNvg 331 

pDvlwrrTASGgnmpLRKFSWLHSASGRVHVl . EdrslkLDDvtLEdmgeytceaDnAvg 

pQITwkr . . KNEPmpvTra YiAKdNrGlRiERvQpSdegeyvcYaRnPAg 3 03 

pTvRwrk— DDgEIipKsr Y E-i-. RddH-Tl-kiRKv-tAGdmgSytcVaEiiMV-g 33_7 

>IG #4 

QiSaRaS 1 IvhappNf TKrpSnKKvGlNgVvQLPcMaSgnpPpSvf wTkegVSTlMf pn . 3 8 8 Dl 
GiTaTGIltvhappKf vlrpKnqLvEIgDEvLf ecQaNgHpRpTLYwsVegNSSllLpGy D2 

TLeasaHlRvqappSfQTkpAdqSvPAggtAtf ecTLVgQpSpaYf wskegQqDllf psy 3 63 CE 

KAeasaTltvqEppHfvVkpRdqVvalgrtvtf QceaTgnpqpalf wRRegsqnllf . sy 396 HI 


Dl 
D2 
CE 

^Hl 
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qlvaQgrtvtf PceTKgnpqpavf wQkegsqnllf pn . H2 

. . .SsHGrQYvAADgtlQitDvrqedegyyv.cSaFSwDssTVrVFlQvSS. .vD 44 0 Dl 

RDGRMEVTLTPEGRSVlSiARFAredSgKVvTcNalnAvgsVSsrTWSvDt . . QF . . . . D2 

VSADGRTK. . vsptgt 1 t iEEvrqVdegAyv . cAGMnSagsslskaAlKvttKAvTGNTP 420 CE 

qpPQsSsrFsvsQtgdltitnvqrsdVgyyi . cqTlnvagsilTkaYlevtd . .vIA. . . 450 HI 

qpQQPNsrCsvsptgdltitnlqrsdAgyyi . cqalTvagsilAkaQlevtd. .vLT. . . H2 

>IG #5 

erpppiiQIgpAnqtlpKgsVaTlpcratgNpSpRiKwFHdgHAvQA.GNRYSi . iqG. . 496 Dl 

eLpppiieqgpvnqtlpvKsIVvlpcrTLgTpvpQVswYLdglpidVqEHERrNLsDA. . D2 

AKpppTieHgHQnqtlMvgsSallpcQaSgKpTpGiswlRdgLpidlTd. . sri . sqHST 477 CE 

drpppViRqgpvnqtVavdgtFvlScVatgSpvpTiLwRkdgVLvSTqd . . sriK. qLeN 507 HI 

drpppiiLqgpAnqtlavdgtaLcKcKatgDpLpViswlkEgFTFPGRd . . PrATiq . eQ H2 

>FN #1 

SslRVDdlq. lsdSgtytciasGeRgeTswAaTltveKpgs . . TSLHraAdpstypAppg 553 Dl 

gAlTiSdlqrHEdEgLytcvasnRNgKsswsGylRLDTptNpNiKf FrapElstypgppg D2 

gslHiAdl . kKPdtgVytciaKneDgestwsaSltveDHtsN . Aqf VrMpdpsNFpsSpT 53 5 CE 

gvlqiR. YAklGdtgRytciasTPsgeatwsaylEvQeFgVp. VqPPrPTdpNLIpsAps 565 HI 

gTlqiKNl . rlsdtgtytcvaTSSsgeaswsaVlDvTeSgAT . i . . SKNYdlsDLpgpps H2 

TpKvLnvsrtsISlRwAKSqEKPGAVgpli . gyTVeyf spdlQTgwIVAaHrvGDtQVti 612 Dl 

kpqMvEKGEnsvtlsw. . . TR SNKVgg S S L Vgy V i e M f GKNE TDg w VAvGT r vQN 1 1 F t Q D2 

QpIIvnvtDtEvElHw. . . NAPS T s g aGp i t gy i i Qy Y sp d 1 gQ TwFN I P D Y vAS t Ey R i 592 CE 

kpEvtdvsrnTvtlsw. . . qpNLNsgaTp . tSyiieaf sHASgSswqtvaENvktEtSAi 621 HI 

kpqvtdvtKnsvtlsw. . . qpGTPGTLpA. SAyiieaf sQSVSNswqtvaNHvkttLytV H2 

>FN #2 

SglTpgtsyVf lvraenTQgisvpsGLsNViktlEA. . . . Df DAASANdlsAarT . llTg 667 Dl 

TglLpgWyFf liraenSHgLsLpsPMsEpitVGTR . . . . Yf NS . . gLdlsEarASllsg D2 

.-kqlkpSHsyMf-Vir.aenEkqig.Tp.syS.sALyttSKPAAQ VAlSDKNKMdMAIaEKRlTsE 652 CE 

kglkpnAiylf IvraAnAYgisDpsqlsDpvktQDV 1 PTS Qg VdHKQVQRE . 1GN 675 HI 

RglRpntiylfMvralnPkV. svT.q H2 

KSvellDasAinAsavrlEwMLHvSADEkyvegLRiHyK. . DaSVPSAQYHSITvMDAsa 725 Dl 

DwelSnasvVDstsMKlTwQI . . . INGkyvegFyVYArQLpNPLNTKyRMLTILNGGGa D2 

QLIKlEEVKTinstavrlFwKKR. . KLEELiDgyyiKWrGPpRTNDNQyVN . . .vTSpsT 707 CE 
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AvLHlHnPTvLSsssIEVHwT. . vDQQSQyiQgyKiLyrPSGaNHGESDWLVFEvRTpAK 733 HI 


>FN #3 

esFwGnlKkytKyef f LTpf . . . f ETiegQpsnskTaltYedvpsappDNIQiGmYn . . 780 Dl 

SsCTiTGlVQytLyef f IVpf . . . YKsVegKpsnsRIaRtledvpsEApYgMEALLln . . D2 

eNYwSnlMPFtnye f f VIpYHSGVHs iHgap snsMDVl t AeAPpsLppEDvRiRmlnL . 766 CE 

NsVviPDlRkGVnyelKARpf . . . f NEFQgaD sE IkFa Kt 1 eEAp s appQgvTVS KNDGN 790 HI 


QtaGWvRwTpppSQHHngNlYgykiEVSAgnTM KVlAnMtLnaTtTsvLlNnltt 835 Dl 

SSaVFLKwkapELKDRHgVlLNyH. vivRglDtAHNFSRIlTnVt IdaASPTLvlAnl tE D2 

. tTLRIswkapKAdGIngllKgFQiviv . gQAPNNNR nltTnERAAsvTlFHlVt 819 CE 

GtalLvswQpppEdTQngMVQEykV. WCLgnEtR YHInKtVdGStFswIPFlVP 844 HI 


gAVy s vr LNS F t KagDgpy s KpISl FMdpTHHVHPpRAHPsGTHDGRHEGqDLTYHNNgN 895 D 1 

gVMyTvGvaaGNnagvgpyCVpATlRldpITKRLDpFINQRDHVND D2 

gMTy KI rvAARSnGgvgv ShgTSEVIMNqDTlEKHL . AAQqENESFLYgL 868 CE 

glRysvEvaaStGagSgvKsEpQFIQldAhgNPVSpEDqVslAQQI 890 HI 

> TM < 

iPPGDIMPTTHKKTTdYlSGpwLMViVCiVlLvlVisAAIsM . vyFkrkhQmTKElGHLS 954 Dl 

vlTqpwFIiiLgAilavlMLs . . f GAMvFVkrkhMm . . MkQsAL D2 

iNK SHVpVIViVaILiIFvViiIAY.CYwRNS.rNSD. . . gkDRSF 909 CE 

SdvVKqp. .AFiagiGAaCWiiLMVf sIwLyRHrkKR. . NglTsTY 932 HI 

WSDNEIT AlniNSKESL. wIDHHRGwRTADTDKD. . 98 8 Dl 

AG I RKVP S FTFTPTVT YQRGGEAVS S GGRPGL 1 n i S EP AAQP wL AD . . TwPNTGNNHNDC 990 HI 


SgLsEsKlLSHVNSSQ. . SnynnS DGGtDyAEvd .... TRNL 1024 Dl 

S I S CCTAGNgNsD sNlTTYSRPADC I AnynnQLDNKQTNLMLPES t VyGDvdLSNKINEM 1050 HI 


CYTOPL"ASMIC~MOTrF-#l 

TtfYNCR KSPDNptpyattMIiGTS sSETCTkT . TSISADkDSGT 1068 Dl 

KtfNSPNLKDGRFWPSGQptpyattQLiQSNLSNNMNNGsGDSGEkHWKPLGQQkQEVA 1110 HI 

HS Py S DAFAGQVPAVp W . . KSNyLqYPVE P 1097 Dl 

PVQyNIVEQNKLNKDYRANDTVPpTIPYNQSyDqNTGGSYNSSDRGSSTSGSQGHKKGAR 117 0 HI 
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CYTOPLASMIC MOTIF #2 

InwSEFlppppEhppp . . . sSTy GyAqGSp 1124 Dl 

TPKVPKQGGMnwADLlppppAhpppHSNsEEyNISVDESyDqEMpCPVPPARMYLQQDEL 12 3 0 HI 

. .eSSRKSSKSAGSglSTNQSILNAsIHsSSSGGFsAWGVSPQYAVAcp 1171 Dl 

EEeEDERGPTPPVRgAASSPAAVSYsHQsTATLTPsPQEELQPMLQDcpEETGHMQHQPD 12 90 HI 

pENVy...sNpl SAVAGGTQNRYQITPTNQHPPQ1 . . . . 1203 Dl 

RRRQPVSPPPPPRPISpPHTyGYIsGplVSDMDTDAPEEEEDEADMEVAKMQTRRlLLRG 13 5 0 HI 

. . . .paY FATTGPGGAVPPNHLP ^ f aTQRHaa 123 0 Dl 

LEQTpaS S VGDLES S VTGSM INGWGS AS EEDNI S SGRS S VS SSDGS FFTDAD f aQAVAaa 1410 HI 

SeyQaglNAar cAQSRACNsCdALATPSPmq 1261 Dl 

Aey .aglKVarRQMQDAAGRRHFHASQcPRPTSPVsTdSNMSAAVmqKTRPAKKLKHQPG 1469 HI 

CYTOPLASMIC MOTIF #3 

ppppvpVpEGWYQPVHPNSH.PMHpTS.SNHQIYQCSSECsDHSRSsQS 1307 Dl 

HLRRETYTDDLppppvpPpAIKSPTAQSKTQLEVRpVWPKLPSMDARTDRsSDRKGsSY 152 9 HI 

HKrQL QLEeHGSSAkQrgGHHRRrA . pWQPCMESeN ENM Dl 

KGrEVLDGRQWDMRTNPGDPREAQeQQNDGkGrgNKAAKrDLpPAKTHLIQeDILPYCRPTF HI 

LAEYEQrQYTsDCCNssrEGDTC SCSeGSCl . . yAe Age PAPRQMTAKKFT 1395 Dl 

PTSNNPrDPSsSSSMssrGSGSRQREQANVGRRNIAeMQVlGGy -eRgeDNNEELEETES 1651 HI 

Exemplary such Robo specific immunogenic and/or antigenic peptides are shown in Table 2. 

Table 2. Immunogenic Robo polypeptides eliciting Robo-specific rabbit polyclonal antibody: 
Robo polyeptide-KLH conjugates immunized per protocol described below. 

^boTPolvf^t^ Immuno genicitv- 

SEQ ID NO:2, residues 68-77 +++ 
SEQ ID NO:2, residues 79-94 +++ 
SEQ ID NO:2, residues 95-103 +++ 
SEQ ID NO:2, residues 122-129 +++ 
SEQ ID NO:2 5 residues 165-176 
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SEQ ID NO:2, residues 181-191 

-H-+ 

SEQ ID NO:2, residues 193-204 

+- H- 

SEQ ID NO:2, residues 244-251 

4— h+ 

SEQ ID NO:2, residues 274-290 

+++ 

SEQ ID NO:2, residues 322-331 


SEQ ID NO:2, residues 339-347 

+++ 

SEQ ID NO:2, residues 407-417 

-H— h 

SEQ ID NO:2, residues 441-451 

-H-+ 

SEQ ID NO:2, residues 453-474 

+++ 

SEQ ID NO:2, residues 502-516 

-H~h 

SEQ ID NO:2, residues 541-553 

+++ 

SEQ ID NO:2, residues 617-629 

+++ 


In addition, species-specific antigenic and/or immunogenic peptides are readily apparent as 
diverged extracellular or cytosolic regions in Table 1 . Exemplary such human specific 
peptides are shown in Table 3. 

Table 3. Immunogenic Robo polypeptides eliciting human Robo-specific rabbit polyclonal 
antibody: Robo polyeptide-KLH conjugates immunized per protocol described below (some 
antibodies show cross-reactivity with corresponding mouse/rat Robo polypeptides). 


Robo Polvpetide. Sequence Immunogenicitv 

SEQ ID NO: 8, residues 1-12 +++ 

SEQ ID NO:8, residues 18-28 +++ 

SEQ ID NO:8, residues 3 1 -40 +++ 

SEQ ID NO:8, residues 45-65 +++ 

SEQ ID NO:8, residues 106-1 16 +++ 

SEQ ID NO:8, residues 137-145 +++ 

SEQ ID NO:8, residues 174-184 +++ 

SEQ ID NO:8, residues 214-230 +++ 

SEQ ID NO:8, residues 274-286 +++ 

SEQ ID NO:8, residues 314-324 - +++ 

SEQ ID NO:8, residues 399-412 +++ 
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SEQ ID NO: 10, residues 222-235 
SEQ ID NO: 10, residues 241-254 


+++ 
+++ 
+++ 
+++ 
+++ 
+++ 


SEQ ID NO:8, residues 496-507 +++ 
SEQ ID NO:8, residues 548-565 
SEQ ID NO:8, residues 599-611 
SEQ ID NO:8, residues 660-671 
SEQ ID NO:8, residues 717-730 
SEQ ID NO:8, residues 780-791 
SEQ ID NO: 8, residues 835-847 
SEQ ID NO:8, residues 877-891 
SEQ ID NO:8, residues 930-942 
SEQ ID NO:8, residues 981-998 
SEQ ID NO:8, residues 1040-1051 +++ 
SEQ ID NO:8, residues 1080-1090 +++ 
SEQ ID NO:8, residues 1154-1168 
SEQ ID NO: 8, residues 1215-1231 
SEQ ID NO:8, residues 1278-1302 +++ 
SEQ ID NO:8, residues 1378-1400 +++ 
SEQ ID NO:8, residues 1460-1469 +++ 


+++ 


+++ 


SEQ ID NO: 8, residues 1497-1519 
SEQ ID NO:8, residues 1606-1626 

SEQ ID NO:8, residues 1639-1651 +++ 

SEQ ID NO:10, residues 5-16 +++ 

SEQ ID NO:10, residues 38-47 +++ 

SEQ ID NO: 1 0, residues 83-94 +++ 

SEQ ID NO: 10, residues 112-125 +++ 

SEQ ID NO: 10, residues 168-180 +++ 

SEQ ID NO: 10, residues 195-209 +++ 


In a particular embodiment, expressed sequence tags EST;yu23dl 1, Accession 
#H77734 and EST;yq76el2, Accession #H52936, as well as peptides conceptually encoded 
thereby, are not within the scope of the present invention (Tables 4 and 5). In a particular 

10 B98-006 



embodiment, the subject Robo polypeptides exclude the corresponding regions of the 
disclosed natural human Robo I polypeptide, i.e. SEQ ID NO:8, residues 168-217 and SEQ ID 
NO:8, residues 1316-1520. 

Table 4 EST:yu23dl 1 sequences compared to H-Robol . yu23dl 1 refers to the fragment of 
DNA which was sequenced. The fragment was sequenced from both ends generating the 
following two sequences: H77734 and H77733. yu23dll is an unspliced cDNA. Only bases 
59-215 match the coding sequence of H-Robol (502-651). The remaining bases are intronic. 
No bases of H77733 match the coding sequence of H-Robol. 

LRDDFRQNPSDVMVAVGEPAVMECQPPRGHPEPTISWKKDGSPLDDKDER H-Robol 
(residues 167-217 of SEQ ID NO:8) 

LRDDFRQKP SDVMVAVGE PAVMECQP PRGHPE PT I S WKKDGS PLDDKDER EST H7 7 734 
(residues 167-217 of SEQ ID NO:8) 

There is an error in the sequence, a T to G change which results in the amino acid N being 
replaced by K. The sequence is shown below and has been reversed for clarity: 

TACTTCGGGATGACTTCAGACAAAAACCTTCGGATGTCATGGTTGCAGTA H-Robol 
(nucleotides 500-549 of SEQ ID NO:7) 

TACTTCGGGATGACTTCAGACAAAACCCTTCGGATGTCATGGTTGCAGTA EST H7 7 734 
(nucleotides 500-549 of SEQ ID NO:7) 

LRDDFRQKPSDVMVAV (residues 167-183 
of SEQ ID NO:8) 


Tabl e 5 EST: yq76el2 sequences compared to H-Robol. yq76el2 refers to the fragment of 
DNA which was sequenced. The fragment was sequenced from both endFgenerating the 
following two sequences: H52936 and H52937 (the latter has been reversed for clarity). The 
sequences can be seen to overlap in the middle. A gap indicates a frameshift error. Note that 
errors only occur in one sequence at any one position. 

GPLVSDMDTDAPEEEEDEADMEVAKMQTRRLLLRGLEQTPASSV H-Robol (residues 


N 
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1315-1359 of SEQ ID NO:8) 

G P L V S DMDTD AP E E E EDE ADME VAKMQ T . RLLLRGLEQTPASSV EST H52 93 6 
(residues 1315-1359 of SEQ ID NO:8) 

GDLESSVTGSMINGWGSASEEDNISSGRSSVSSSDGSFFTDADF H-Robol (residues 
1360-1403 of SEQ ID NO:8) 

GDLESSVTGSMINGWGSASEEDNISSGRSSVSSSDGSFFTDADF EST H52 93 6 
(residues 1360-1403 of SEQ ID NO:8) 

AQAVAAA AEYAGLKVARRQMQDA AGR RHFH AS QC PRPT H-Robol (residues 
1404-1441 of SEQ ID NO:8) 

AQAVAAA AEYAGLKVARRQMQDA AGR RHFH AF QC PRPT EST H52 93 6 (residues 
1404-1441 of SEQ ID NO:8) 

?AAT A? YAGLKVARRQMRDA AGR RHFH AS QC PRPT EST H52 93 7 (residues 
1404-1441 ofSEQIDNO:8) 

SPVSTDSNMSAAVMQKTRPAKKLKHQPGHLRRETYTDDLPPPPV H-Robol (residues 
1442-1485 of SEQ ID NO:8) 

SPVFTDSNM EST H52 93 6 

(residues 1442-1450 of SEQ ID NO:8) 

SPVSTDSNMSAAVMQKTRPAKKLKHQPGHLRRETYTDDLPPPPV EST H52 93 7 
(residues 1442-1485 of SEQ ID NO:8) 

PPPAIKSPTAQSKTQLEVRPVWPKLPSMDARTDK H-Robol (residues 

1486-1520 of SEQ ID NO:8) 

PPPAIKSPTAQSKTQLEVRPVWPKLPSMDARTDK EST H52 93 7 

(residues 1486-1520 of SEQ ID NO:8) 


The subject domains provide Robo domain specific activity or function, such as Robo- 
specific cell, especially neuron modulating or modulating inhibitory activity, Robo-ligand- 
binding or binding inhibitory activity. Robo-specific activity or function may be determined 
by convenient in vitro, cell-based, or in vivo assays: e.g. in vitro binding assays, cell culture 
assays, in animals (e.g. gene therapy, transgenics, etc.), etc. Binding assays encompass any 
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assay where the molecular interaction of a Robo polypeptide with a binding target is 
evaluated. The binding target may be a natural intracellular binding target, a Robo regulating 
protein or other regulator that directly modulates Robo activity or its localization; or non- 
natural binding target such as a specific immune protein such as an antibody, or a Robo 
specific agent such as those identified in screening assays such as described below. Robo- 
binding specificity may be assayed by binding equilibrium constants (usually at least about 
10 7 M\ preferably at least about 10 8 M' 1 , more preferably at least about 10 9 M 1 ), by the ability 
of the subject polypeptide to function as negative mutants in Robo-expressing cells, to elicit 
Robo specific antibody in a heterologous host (e.g a rodent or rabbit), etc. 

The claimed Robo polypeptides are isolated or pure: an "isolated" polypeptide is 
unaccompanied by at least some of the material with which it is associated in its natural state, 
preferably constituting at least about 0.5%, and more preferably at least about 5% by weight of 
the total polypeptide in a given sample and a pure polypeptide constitutes at least about 90%, 
and preferably at least about 99% by weight of the total polypeptide in a given sample. A 
polypeptide, as used herein, is a polymer of amino acids, generally at least 6 residues, 
preferably at least about 10 residues, more preferably at least about 25 residues, most 
preferably at least about 50 residues in length. The Robo polypeptides and polypeptide 
domains may be synthesized, produced by recombinant technology, or purified from 
mammalian, preferably human cells. A wide variety of molecular and biochemical methods 
are available for biochemical synthesis, molecular expression and purification of the subject 
compositions, see e.g. Molecular Cloning, A Laboratory Manual (Sambrook, et ah Cold 
Spring Harbor Laboratory), Current Protocols in Molecular Biology (Eds. Ausubel, et al, 
Greene Publ. Assoc., Wiley-Interscience, NY) or that are otherwise known in the art. 

The invention provides binding agents specific to the claimed Robo polypeptides, 
including natural intracellular binding targets, etc., methods of identifying and making such 
agents, and their use in diagnosis, therapy and pharmaceutical development. For example, 
specificbindingagentsare usefuHnavarietyof diagnosticandtherapeutic applications; 
especially where pathology, wound repair incompetency or prognosis is associated with 
improper or undesirable axon outgrowth, orientation or inhibition thereof. Novel Robo- 
specific binding agents include Robo-specific receptors, such as somatically recombined 
polypeptide receptors like specific antibodies or T-cell antigen receptors (see, e.g Harlow and 
Lane (1988) Antibodies, A Laboratory Manual, Cold Spring Harbor Laboratory), natural 
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intracellular binding agents identified with assays such as one-, two- and three-hybrid screens, 
non-natural intracellular binding agents identified in screens of chemical libraries such as 
described below, etc. Agents of particular interest modulate Robo function. 

In a particular embodiment, the subject polypeptides are used to generate Robo- or 
human Robo-specific antibodies. For example, the Robo- and human Robo-specific peptides 
described above are covalently coupled to keyhole limpet antigen (KLH) and the conjugate is 
emulsified in Freunds complete adjuvant. Laboratory rabbits are immunized according to 
conventional protocol and bled. The presence of Robo-specific antibodies is assayed by solid 
phase immunosorbant assays using immobilized Robo polypeptides of SEQ ID NO:2, 4, 6, 8, 
10 or 12. Human Robo-specific antibodies are characterized as uncross-reactive with non- 
human Robo polypeptides (SEQ ID NOS:2, 4, 6 and 12). 

Accordingly, the invention provides methods for modulating cell function comprising 
the step of modulating Robo activity, e.g. by contacting the cell with a Robo inhibitor, e.g. 
inhibitory Robo deletion mutants, Robo-specific antibodies, etc. (supra). The target cell may 
reside in culture or in situ, i.e. within the natural host. The inhibitor may be provided in any 
convenient way, including by (i) intracellular expression from a recombinant nucleic acid or 
(ii) exogenous contacting of the cell. For many in situ applications, the compositions are 
added to a retained physiological fluid such as blood or synovial fluid. For CNS 
administration, a variety of techniques are available for promoting transfer of the therapeutic 
across the blood brain barrier including disruption by surgery or injection, drugs which 
transiently open adhesion contact between CNS vasculature endothelial cells, and compounds 
which facilitate translocation through such cells. Robo polypeptide inhibitors may also be 
amenable to direct injection or infusion, topical, intratracheal/nasal administration e.g. through 
aerosol, intraocularly, or within/on implants e.g. fibers e.g. collagen, osmotic pumps, grafts 
comprising appropriately transformed cells, etc. A particular method of administration 
involves coating, embedding or derivatizing fibers, such as collagen fibers, protein polymers, 

ete-with-therapeutie proteins— Other-useful-approaehes-are deseribed-in Gtto-et-al-(-1989) J 

Neuroscience Research 22, 83-91 and Otto and Unsicker (1990) J Neuroscience 10, 191 2- 
1921. Generally, the amount administered will be empirically determined, typically in the 
range of about 10 to 1000 Jig/kg of the recipient and the concentration will generally be in the 
range of about 50 to 500 \ig/m\ in the dose administered. Other additives may be included, 
such as stabilizers, bactericides, etc. will be present in conventional amounts. For diagnostic 
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uses, the inhibitors or other Robo binding agents are frequently labeled, such as with 
fluorescent, radioactive, chemiluminescent, or other easily detectable molecules, either 
conjugated directly to the binding agent or conjugated to a probe specific for the binding 
agent. 

The amino acid sequences of the disclosed Robo polypeptides are used to back- 
translate Robo polypeptide-encoding nucleic acids optimized for selected expression systems 
(Holler et al. (1993) Gene 136, 323-328; Martin et al. (1995) Gene 154, 150-166) or used to 
generate degenerate oligonucleotide primers and probes for use in the isolation of natural 
Robo-encoding nucleic acid sequences ("GCG" software, Genetics Computer Group, Inc, 
Madison WI). Robo-encoding nucleic acids used in Robo-expression vectors and 
incorporated into recombinant host cells, e.g. for expression and screening, transgenic 
animals, e.g. for functional studies such as the efficacy of candidate drugs for disease 
associated with Robo-modulated cell function, etc. 

The invention also provides nucleic acid hybridization probes (Tables 6, 7) and 
replication / amplification primers (Tables 7, 8) having a Robo cDNA specific sequence 
comprising SEQ ID NO:l, 3, 5, 7, 9 or 1 1 and sufficient to effect specific hybridization 
thereto (i.e. specifically hybridize with SEQ ID NO:l, 3, 5, 7, 9 or 11, respectively, in the 
presence of CDO cDNA. 

Table 5., Hybridisation Probes for Human Roundabout 1 
Immunoglobulin Domain #1 

CCACCTCGCATTGTTGAACACCCTTCAGACCTGATTGTCTCAAAAGGAGAACCTGCAACTTTGAACTGCAAAGCT 
GAAGGCCGCCCCACACCCACTATTGAATGGTACAAAGGGGGAGAGAGAGTGGAGACAGACAAAGATGACCCTCGC 
TCACACCGAATGTTGCTGCCGAGTGGATCTTTATTTTTCTTACGTATAGTACATGGACGGAAAAGTAGACCTGAT 
GAAGGAGTCTATGTCTGTGTAGCAAGGAATTACCTTGGAGAGGCTGTGAGCCACAATGCATCGCTGGAAGTAGCC 

ata (nucleotides 134-501 ofSEQIDNO:7) 
Immunoglobulin Domain#2 

CTTCGGGATGACTTCAGACAAAACCCTTCGGATGTCATGGTTGCAGTAGGAGAGCCTGCAGTAATGGAATGCCAA 
CCTCCACGAGGCCATCCTGAGCCCACCATTTCATGGAAGAAAGATGGCTCTCCACTGGATGATAAAGATGAAAGA 
ATAACTATACGAGGAGGAAAGCTCATGATCACTTACACCCGTAAAAGTGACGCTGGCAAATATGTTTGTGTTGGT 

ACCAATATGGTTGGGGAACGTGAGAGTGAAGTAGCCGAGCTGACTGTCTT (nucleotides 502-776 of SEQ 

ID NO:7) 
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Immunoglobulin Domain #3 

AGAGAGACCATCATTTGTGAAGAGACCCAGTAACTTGGCAGTAACTGTGGATGACAGTGCAGAATTTAAATGTGA 
GGCCCGAGGTGACCCTGTACCTACAGTACGATGGAGGAAAGATGATGGAGAGCTGCCCAAATCCAGATATGAAAT 
CCGAGATGATCATACCTTGAAAATTAGGAAGGTGACAGCTGGTGACATGGGTTCATACACTTGTGTTGCAGAAAA 

TATGGTGGGCAAAGCTGAAGCATCTGCTACTCTGACTGTTCAAGAACC (nucleotides 777- 1 049 of SEQ 

ID NO:7) 

Immunoglobulin Domain #4 

CCACATTTTGTTGTGAAACCCCGTGACCAGGTTGTTGCTTTGGGACGGACTGT7U\CTTTTCAGTGTGAAGCAACC 
GGAAATCCTCAACCAGCTATTTTCTGGAGGAGAGAAGGGAGTCAGAATCTACTTTTCTCATATCAACCACCACAG 
TCATCCAGCCGATTTTCAGTCTCCCAGACTGGCGACCTCACAATTACTAATGTCCAGCGATCTGATGTTGGTTAT 
TACATCTGCCAGACTTTAAATGTTGCTGGAAGCATCATCACAAAGGCATATTTGGAAGTTACAGATGTGATTGCA 

(nucleotides 1051-1350 of SEQ ID NO:7) 
Immunoglobulin Domain #5 

GATCGGCCTCCCCCAGTTATTCGACAAGGTCCTGTGAATCAGACTGTAGCCGTGGATGGCACTTTCGTCCTCAGC 
TGTGTGGCCACAGGCAGTCCAGTGCCCACCATTCTGTGGAGAAAGGATGGAGTCCTCGTTTCAACCCAAGACTCT 
CGAATCAAACAGTTGGAGAATGGAGTACTGCAGATCCGATATGCTAAGCTGGGTGATACTGGTCGGTACACCTGC 
ATTGCATCAACCCCCAGTGGTGAAGCAACATGGAGTGCTTACATTGAAGTTCAAGAATTTG (nucleotides 

1351-1636 of SEQ ID NO:7) 
Fibronectin Domain #1 

GAGTTCCAGTTCAGCCTCCAAGACCTACTGACCCAAATTTAATCCCTAGTGCCCCATCAAAACCTGAAGTGACAG 
ATGTCAGCAGAAATACAGTCACATTATCGTGGCAACCAAATTTGAATTCAGGAGCAACTCCAACATCTTATATTA 
TAGAAGCCTTCAGCCATGCATCTGGTAGCAGCTGGCAGACCGTAGCAGAGAATGTGAAAACAGAAACATCTGCCA 
TTAAAGGACTCAAACCTAATGCAATTTACCTTTTCCTTGTGAGGGCAGCTAATGCATATGGAATTAGTGATC 

(nucleotides 1637-1933 of SEQ IDNO:7) 


Fibronectin Domain #2 

CAAGCCAAATATCAGATCCAGTGAAAACACAAGATGTCCTACCAACAAGTCAGGGGGTGGACCACAAGCAGGTCC 
AGAGAGAGCTGGGAAATGCTGTTCTGCACCTCCACAACCCCACCGTCCTTTCTTCCTCTTCCATCGAAGTGCACT 
GGACAGTAGATCAACAGTCTCAGTATATACAAGGATATAAAATTCTCTATCGGCCATCTGGAGCCAACCACGGAG 
AATCAGACTGGTTAGTTTTTGAAGTGAGGACGCCAGCCAAAAACAGTGTGGTAATCCCTGATCTCAGAAAGGGAG. 
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TCAACTATGAAATTAAGGCTCGCCCTTTTTTTAATGAATTTCAAGGAGCAG (nucleotides 1934-2284 of 

SEQ ID NO:7) 
Fibronectin Domain #3 

ATAGTGAAATCAAGTTTGCCAAAACCCTGGAAGAAGCACCCAGTGCCCCACCCCAAGGTGTAACTGTATCCAAGA 
ATGATGGAAACGGAACTGCAATTCTAGTTAGTTGGCAGCCACCTCCAGAAGACACTCAAAATGGAATGGTCCAAG 
AGTATAAGGTTTGGTGTCTGGGCAATGAAACTCGATACCACATCAACAAAACAGTGGATGGTTCCACCTTTTCCG 
TGGTCATTCCCTTTCTTGTTCCTGGAATCCGATACAGTGTGGAAGTGGCAGCCAGCACTGGGGCTGGGTCTGGGG 

taaag (nucleotides 2285-2589 of SEQ ID NO:7) 
Transmembrane Domain 

agatttcagatgtggtgaagcagccggccttcatagcaggtattggagcagcctgttggatcatcctcatggtct 
tcagcatctggctttatcgacaccg (nucleotides 2666-2765 of SEQ ID NO:7) 

Cytoplasmic Motif #1 

aatctgaaggatgggcgttttgtcaatccatcagggcagcctactccttacgccaccactcagctcatccagtca 
aacctcagcaacaacatgaacaatg (nucleotides 3169-3268 of SEQ ID NO:7) 

Cytoplasmic Motif #2 

cccaaggtaccaaaacagggtggcatgaactgggcagacctgcttcctcctcccccagcacatcctcctccacac 
agcaatagcgaagagtacaacattt (nucleotides 3514-3613 of SEQ ID NO:7) 

Cytoplasmic Motif #3 

ccagccaggacatctgcgcagagaaacctacacagatgatcttccaccacctcctgtgccgccacctgctataaa 
gtcacctactgcccaatccaagaca (nucleotides 4398-4497 of SEQ ID NO:7) 

Table 6. Hybridisation Probes for Human Roundabout 2 

Immunoglobulin-Domain #4 „ 

cagattgttgctcaaggtcgaacagtgacatttccctgtgaaactaaaggaaacccacagccagctgttttttgg 
c ag aaag aagg c ag c c agaac c t ac t t t t c c c aaac c aac c c c ag c agc cc aac ag tag at g c t c ag t g t c ac c a 
actggagacctcacaatcacc/^acattcaacgttccgacgcgggttactacatctgccaggctttaactgtggca 

GGAAGCATTTTAGCAAAAGCTCAACTGGAGGTTACTGATGTTTTGACA (nucleotides 1-273 of SEQ ID 

NO:9) 
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Immunoglobulin Domain #5 

GATAGACCTCCACCTAT7\ATTCTACAAGGCCCAGCCAACCAAACGCTGGCAGTGGATGGTACAGCGTTACTGAAA 
TGTAAAGCCACTGGTGATCCTCTTCCTGTAATTAGCTGGTTAAAGGAGGGATTTACTTTTCCGGGTAGAGATCCA 
AGAGCAACAATTCAAGAGCAAGGCACACTGCAGATTAAGAATTTACGGATTTCTGATACTGGCACTTATACTTGT 
GTGGCTACAAGTTCAAGTGGAGAGGCTTCCTGGAGTGCAGTGCTGGATGTGACAGAGTCT (nucleotides 274- 

558 ofSEQ ID NO:9) 
Fibronectin Domain #1 

GGAGCAACAATCAGTAAAAACTATGATTTAAGTGACCTGCCAGGGCCACCATCCAAACCGCAAGTCACTGATGTT 
ACTAAGAACAGTGTCACCTTGTCCTGGCAGCCAGGTACCCCTGGAACCCTTCCAGCAAGTGCATATATCATTGAG 
GCTTTCAGCCAATCAGTGAGCAACAGCTGGCAGACCGTGGC7^AACCATGTA7\AGACCACCCTCTATACTGTAAGA 
GGACTGCGGCCCAATACAATCTACTTATTCATGGTCAGAGCGATCAACCCCAAGGTYTCAGTGACCCAAGT 

(nucleotides 559-854 of SEQ ID NO:9) 

Table 7. Primer Pairs for PCR of Human Roundabout 1 Domains 
Immunoglobulin Domain #1 

Forward : 5 ' CCACCTCGCATTGTTGAACACCCTTCAGAC 3 ' (nucleotides 1 99-228 of SEQ ID 

NO:7) 

Reverse : 5 1 atggctacttccagcgatgcattgtggctc 3 ' (reverse complement of nucleotides 
471-500 of SEQ ID NO:7) 

Immunoglobulin Domain #2 

Forward : 5 ■ CTTCGGGATGACTTCAGACAAAACCCTTCG 3 1 (nucleotides 502-533 of SEQ ID 

NO:7) 

Reverse : 5 • taagacagtcagctcggctacttcactctc 3 ■ (reverse complement of nucleotides 
751-777 ofSEQIDNO:7) 

Immunoglobulin Domain #3 

Forward: 5 ' AGAGAGACCATCATTTGTGAAGAGACCCAG 

NO:7) 

Reverse : 5 ' AGGTTCTTGAACAGTCAGAGTAGCAGATGC 

1021-1050 of SEQ ID NO:7) 
Immunoglobulin Domain #4 


-3- (nucleotides-7-77-806-of-SEQ-ID- 

3 ' (reverse complement of nucleotides 
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Forward : 5 ' CCACATTTTGTTGTGAAACCCCGTGACCAG 3 • (nucleotides 105 1-1080 of SEQ ED 

NO:7) 

Reverse : 5 ' tgcaatcacatctgtaacttccaaatatgc 3 1 (reverse complement of nucleotides 
1321-1350 of SEQ ID NO:7) 

Immunoglobulin Domain #5 

Forward : 5 1 ATCGGCCTCCCCCAGTTATTCGACAAGGTC 

NO:7) 

Reverse : 5 1 CAAATTCTTGAACTTCAATGTAAGCACTCC 

1607-1636 of SEQIDNO:7) 
Fibronectin Domain #1 

Forward : 5 ' GAGTTCCAGTTCAGCCTCCAAGACCTACTG 3 ' (nucleotides 1637-1666 of SEQ ID 

NO:7) 

Reverse : 5 ' tcactaattccatatgcattagctgccctc 3 ' (reverse complement of nucleotides 
1902-1931 of SEQIDNO:7) 

Fibronectin Domain #2 

Forward : 5 * CAAGCCAAATATCAGATCCAGTGAAAACAC 3 ' (nucleotides 1934-1963 of SEQ ID 

NO:7) 

Reverse : 5 1 atctgctccttgaaattcattaaaaaaagg 3 ' (reverse complement of nucleotides 
2257-2286 of SEQ ID NO:7) 

Fibronectin Domain #3 

Forward : 5 ■ ATAGTGAAATCAAGTTTGCCAAAACCCTG 3 * (nucleotides 2285-23 13 of SEQ ID 

NO:7) 

.Reverse : 5 1 ctctttaccccagacc cagccccagtgctg 3 • (reverse complement of nucleotides 
2561-2591 ofSEQIDNO:7) 

Transmembrane Domain 

Forward: 5 • ggaccaagtcagcctcgctcagcagatttc 3 1 (nucleotides 2643-2672 of SEQ ID 
NO:7) " ~ - 


3 ■ (nucleotides 1352-1381 of SEQ ID 
3 ■ (reverse complement of nucleotides 
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Reverse : 5 1 actagtaagtccgtttctcttcttgcggtg 3 • (reverse complement of nucleotides 
2761-2790 of SEQ ID NO:7) 

Cytoplasmic Motif #1 

Forward : 5 1 CTGAAGGATGGGCGTTTTGTCAATCCATC 3 ' (nucleotides 3 1 72-3200 of SEQ ID 

NO:7) 

Reverse : 5 1 gtcccagtggtttccagtgcttctcgccag 3 ' (reverse complement of nucleotides 
3281-3310 of SEQ ID NO:7) 

Cytoplasmic Motif #2 

Forward: 5 ' GGCACAAGAAAGGGGCAAGAACACCCAAGG 

NO:7) 

Reverse: 5 ' ATAGCTTTCATCTACAGAAATGTTGTACTC 

3601-3630 of SEQ ID NO:7) 
Cytoplasmic Motif #3 

Forward : 5 ' AC CAGACCAGC C AAGAAACTGAAACAC C AG 

NO:7) 

Reverse : 5 ' GTACTTCCAGCTGTGTCTTGGATTGGGCAG 

4481.-4510 of SEQ ID NO:7) 

Table 8. Human Roundabout 2 Primer Pairs 
Immunoglobulin Domain #4 

Forward: 5 » GTTGCTCAAGGTCGAACAGTGACATTTCCC 
Reverse: 5 ' TGTCAAAACATCAGTAACCTCCAGTTGAGC 

244-237 of SEQ ID NO:9) 


Immunoglobulin Domain #5 

Forward: 5 1 GATAGACCTCCACCTATAATTCTACAAGGC 3 '(nucleotides 274-303 of SEQ ID 

NO:9) 

Reverse : 5 • " gactctgtcacatcgagcactgcactccag 3 1 (reverse complement of nucleotides 
528-557 of SEQ ID NO:9) 
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3 • (nucleotides 3491-3520 of SEQ ID 
3 ' (reverse complement of nucleotides 


3 ■ (nucleotides 4372-4401 of SEQ ID 
3 * (reverse complement of nucleotides 


3 ' (nucleotides 7-36 of SEQ ID NO:9) 
3 1 (reverse complement of nucleotides 



Fibronectin Domain #1 

Forward : 5 ' CAATCAGTAAAAACTATGATTTAAGTG 3 ' (nucleotides 566-592 of SEQ ID NO:9) 

Reverse : 5 ■ tcgctctgaccatgaataagtagattg 3 1 (reverse complement of nucleotides 800- 
826 of SEQIDNO:9) 

Such primers or probes are at least 12, preferably at least 24, more preferably at least 36 and 
most preferably at least 96 bases in length. Demonstrating specific hybridization generally 
requires stringent conditions, for example, hybridizing in a buffer comprising 30% formamide 
in 5 x SSPE (0.18 M NaCl, 0.01 M NaP0 4 , pH7.7, 0.001 M EDTA) buffer at a temperature of 
42*C and remaining bound when subject to washing at 42°C with 0.2 x SSPE; preferably 
hybridizing in a buffer comprising 50% formamide in 5 x SSPE buffer at a temperature of 
42°C and remaining bound when subject to washing at 42°C with 0.2 x SSPE buffer at 42°C. 
Robo nucleic acids can also be distinguished using alignment algorithms, such as BLASTX 
(Altschul et al (1990) Basic Local Alignment Search Tool, J Mol Biol 215, 403-410). 

The subject nucleic acids are of synthetic/non-natural sequences and/or are isolated, 
i.e. unaccompanied by at least some of the material with which it is associated in its natural 
state, preferably constituting at least about 0.5%, preferably at least about 5% by weight of 
total nucleic acid present in a given fraction, and usually recombinant, meaning they comprise 
a non-natural sequence or a natural sequence joined to nucleotide(s) other than that which it is 
joinedjo on a natural chromosome. The subject recombinant nucleic acids comprising the , 
nucleotide sequence of SEQ ID NO:l 5 3, 5, 7, 9 or 1 1, or fragments thereof, contain such 
sequence or fragment at a terminus, immediately flanked by (i.e. contiguous with) a sequence 
other than that which it is joined to on a natural chromosome, or flanked by a native flanking 
region fewer than 10 kb, preferably fewer than 2 kb, more preferably fewer than 500 bp, which 
is at a terminus or is immediately flanked by a sequence other than that which it is joined to on 
a natural chromosome. While the nucleic acids are usually RNA or DNA, it is often 
advantageous to use nucleic acids comprising other bases or nucleotide analogs to provide" 
modified stability, etc. 

In a particular embodiment, expressed sequence tags EST;yu23dl 1, Accession 
#H77734 and EST;yq76el2, Accession #H52936, and deletion mutants thereof, are not within 
the scope of the present invention. In another embodiment, the subject Robo nucleic acids 
exclude the corresponding regions of the disclosed natural human Robo I nucleic acids, i.e. 
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SEQ ID NO:7, nucleotides 502-651 and SEQ ID NO:7, nucleotides 3946-4560. 

Table 10. Exemplary differences between H52936 and corresponding human Robo I 
sequences. 

(1) At position 86, there is a T instead of an A. The new codon therefore reads TGA (Stop) 
instead of AGA (R). 

(2) There is a missing G at position 286-7, causing a frameshift. 

(3) There is an extra G at position 334, causing a frameshift. 

(4) There is an extra T at position 344, causing a frameshift. 

(5) There is an extra N at position 357, causing a frameshift. 

(6) There is a T instead of a C at 362. The new codon reads TTT (F) instead of TCT (S). 

(7) There is an extra T at position 364, causing a frameshift. 

(8) There is an extra N at position 370, causing a frameshift and a changed amino acid (the 
codon TTN is ambiguous). 

(9) There are two Ts at position 394 and 395 instead of a C, causing a frameshift and amino 
acid changes. 

Table 1 1 . Exemplary differences between H52937 (reverse sequence) and corresponding 
human Robo I sequences. 

(1) There are multiple errors in the first 30 bases. 

(2) At position 63, a G replaces an A. The new codon CGG codes for R instead of CAG for Q. 

(3) The EST ends by joining to part of the human glycophorin B gene (353-442) 

The subject nucleic acids find a wide variety of applications including use as 
translatable transcripts, hybridization probes, PCR primers, diagnostic nucleic acids, etc.; use 
in detecting the presence of Robo genes and gene transcripts and in detecting or amplifying 
nucleic acids encoding additional Robo^homologs andTtfucturalla^^ 

hybridization probes find use in identifying wild-type and mutant Robo alleles in clinical and 
laboratory samples. Mutant alleles are used to generate allele-specific oligonucleotide (ASO) 
probes for high-throughput clinical diagnoses. In therapy, therapeutic Robo nucleic acids are 
used to modulate cellular expression or intracellular concentration or availability of active 
Robo. 
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The invention provides efficient methods of identifying agents, compounds or lead 
compounds for agents active at the level of a Robo modulatable cellular function. Generally, 
these screening methods involve assaying for compounds which modulate 
Robo interaction with a natural Robo binding target. A wide variety of assays for binding 
agents are provided including labeled in vitro protein-protein binding assays, immunoassays, 
cell based assays, etc. The methods are amenable to automated, cost-effective high throughput 
screening of chemical libraries for lead compounds. Identified reagents find use in the 
pharmaceutical industries for animal and human trials; for example, the reagents may be 
derivatized and rescreened in in vitro and in vivo assays to optimize activity and minimize 
toxicity for pharmaceutical development. 

Cell and animal based neural guidance/repulsion assays are described in detail in the 
experimental section below. In vitro binding assays employ a mixture of components 
including a Robo polypeptide, which may be part of a fusion product with another peptide or 
polypeptide, e.g. a tag for detection or anchoring, etc. The assay mixtures comprise a natural 
intracellular Robo binding target. While native full-length binding targets may be used, it is 
frequently preferred to use portions (e.g. peptides) thereof so long as the portion provides 
binding affinity and avidity to the subject Robo polypeptide conveniently measurable in the 
assay. The assay mixture also comprises a candidate pharmacological agent. Candidate agents 
encompass numerous chemical classes, though typically they are organic compounds; 
preferably small organic compounds and are obtained from a wide variety of sources including 
libraries of synthetic or natural compounds. A variety of other reagents may also be included 
in the mixture. These include reagents like salts, buffers, neutral proteins, e.g. albumin, 
detergents, protease inhibitors, nuclease inhibitors, antimicrobial agents, etc. may be used. 

The resultant mixture is incubated under conditions whereby, but for the presence of 
the candidate pharmacological agent, the Robo polypeptide specifically binds the cellular 
binding target, portion or analog with a reference binding affinity. The mixture components 

can be added in any order that provides for the requisite~bindings and~incubations may be 

performed at any temperature which facilitates optimal binding. Incubation periods are 
likewise selected for optimal binding but also minimized to facilitate rapid, high-throughput 
screening. 

After incubation, the agent-biased binding-between the Robo polypeptide and one or 
more binding targets is detected by any convenient way. Where at least one of the Robo or 
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binding target polypeptide comprises a label, the label may provide for direct detection as 
radioactivity, luminescence, optical or electron density, etc. or indirect detection such as an 
epitope tag, etc. A variety of methods may be used to detect the label depending on the 
nature of the label and other assay components, e.g. through optical or electron density, 
radiative emissions, nonradiative energy transfers, etc. or indirectly detected with antibody 
conjugates, etc. 

A difference in the binding affinity of the Robo polypeptide to the target in the absence 
of the agent as compared with the binding affinity in the presence of the agent indicates that 
the agent modulates the binding of the Robo polypeptide to the Robo binding target. For 
example, in the cell-based assay also described below, a difference in Robo-dependent 
modulation of axon outgrowth or orientation in the presence and absence of an agent indicates 
the agent modulates Robo function. A difference, as used herein, is statistically significant 
and preferably represents at least a 50%, more preferably at least a 90% difference. 

The following experimental section and examples are offered by way of illustration 
and not by way of limitation. 

EXPERIMENTAL 

Cloning of the roundabout Gene. The robo 1 allele was mapped to the plexus-brown 
interval on the right arm of the second chromosome by recombination mapping; the numbers 
of recombinants suggested a map position very close to plexus at 58F/59A. One deficiency 
[Df(2R)P, which deletes 58E3/FT through 60D14/E2] fails to complement robo mutations, 
two other deficiencies [Df(2R)59AB and Df(2R)59AD, which delete 59A1/3 through 59B1/2 
and 59A1/3 through 59D1/4 respectively] do complement robo, and a duplication - 
[Dp(2;Y)bw + Y, which duplicates 58F1/59A2 through 60E3/F1] rescues robo mutations. This 
mapping places robo in the 58F/59A region. 

We initiated chromosomal walks from PI clones mapped to the region, beginning from 

the distal side using clone DS02204 and from the proximaHiideusin^ 

used cosmid clones (Tamkun et al., 1992) to complete a walk of -150 kb. We then looked for 
RFLPs in the recombinants between the multiple marked chromosome and the robo mutant 
chromosome. A 6.8kb EcoRI fragment from cosmid 106-5 identified a Hindll RFLP on the 
mapping chromosome that was present on a single robo mutant recombinant line. This 
fragment identified a proximal limit for the location of robo. Further deficiencies in this 
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region were then tested (Kerrebrock et al., 1995). Of these deficiencies, Df(2R)X58-5 and 
Df(2R)X58-12 remove robo while Df(2R)X58-l does not. Df(2R)X58-12 fails to complement 
Df(2R)59AB yet complements Df(2R)59AD indicating that Df(2R)59AB extends further 
proximal; this proximal endpoint provides a distal limit for the location of robo. Probes from 
the walk were used to identify the breakpoints of these deficiencies (Figure 1 A). Df(2R)X58-l 
breaks in a 9.6 kb EcoRI/BamHI fragment within cosmid GJ12 5 whereas Df(2R) 59AB breaks 
in a 8 kb BamHI/EcoRI fragment within cosmid 106-1435. This reduces the location of robo 
to a 75 kb region bounded by these restriction fragments. Hybridization of 0-16 hr poly-A + 
embryonic Northern blots with cosmids GJ12, 106-12, and 106-1435 revealed at least five 
transcripts. Reverse Northern mapping identified the regions containing these transcripts 
(Figure 1 A). These regions were used as probes to isolate cDNAs. Seven different cDNAs 
were isolated and analyzed by in situ hybridization. The expression pattern of five of these 
transcripts allowed us to tentatively discount them as encoding for robo since they were not 
expressed in the embryonic CNS at the appropriate stage. Of the two cDNAs remaining, 12-1 
appeared by its size and expression the most likely candidate for robo. A 16 kb Xbal 
fragment including the 12-1 transcript and a region 5' to the transcript is capable of rescuing 
the robo mutant. 

roundabout Encodes a Member of the Immunoglobulin Superfamily. We recovered 
and sequenced overlapping cDNA clones corresponding to the 12-1 transcription unit. A 
single long open reading frame (ORF) that encodes 1395 amino acids was identified (Dl in. 
Table 1). Conceptual translation of the ORF reveals the Robo protein to be a member of the 
Ig superfamily; Robo f s ectodomain contains five immunoglobulin (Ig)-like repeats followed 
by three fibronectin (Fn) type-Ill repeats. The predicted ORF also contains a transmembrane 
domain and a large 457 amino acid (a.a.) cytoplasmic domain. Hydropathy analysis of the 
Robo sequence indicates a single membrane spanning domain of 25 a.a. (Kyte and Doolittle, 
1982) plus a signal sequence with a predicted cleavage site between G51 and Q52 (Nielsen et 
al 1997). ~~ " 

We identify the 12-1 transcript as encoding robo based on several criteria. First, the 
embryonic robo phenotype can be rescued by the 16 kb Xbal genomic fragment containing 
this cDNA; no other transcripts are contained in this 16 kb Xbal fragment. Second, we 
identified a Cfol RFLP associated with the allele robo 6 . This polymorphism is due to a 
change of nucleotide 332 of the ORF from G to A, which results in a change of Gly m to Asp. 
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Gly 1 1 1 is in the first Ig domain (Figure 2), and is conserved in all Robo homologues 
identified. The change is specific to the allele robo 6 and is not seen in the parental 
chromosome or in any of the other seven alleles, all of which were generated from the same 
parental genotype. Third, the production of antibodies (below) which recognize the Robo 
protein reveals that the alleles robo 1 , robo 2 , robo 3 , robo 4 and robo 5 do not produce Robo 
protein (Table 12). 

Table 12. robo Mutant Alleles 


Allele Synonym Class 

robo 1 GA285 Protein null 

robo 2 GA1112 Protein null 

robo 3 Z14 Protein null 

robo 4 Z570 Protein null 

robo 5 Z1772 Protein null 

robo 6 Z1757 Protein positive; Gly nl to Asp 

robo 7 Z2 1 30 Reduced protein levels 

robo 8 Z3 1 27 Protein positive 


All alleles were generated by EMS mutagenesis of FasIII null chromosomes. Each of these 
alleles appear to represent a complete, or near complete, loss-of- function phenotype for robo, 
since the mutant phenotype observed when these alleles are placed over a chromosome 
deficient for the robo locus [Df(2R) X58-5] is indistinguishable from the homozygous allele. 

Finally, transgenic neural expression of robo rescues the midline crossing phenotype of robo 
mutants (see below). 

Developmental Northern blot analysis using both cDNA and genomic probes suggests 
that robo is encoded by a single transcript of -7500 bp. We sequenced genomic DNA and 
identified 17 introns within the sequence of which 14 are only 50-75 bp in length plus three 
intronsjof843bp^3_6_bp , and 1 10 bp (Figure IB). The precise start point of the transcript has 
not been determined. 

A Family of Evolutionarily Conserved Robo-like Proteins. The presence of five Ig and 
three Fn domains, a transmembrane domain, and a long (452 a.a.) cytoplasmic region indicates 
that Robo may be a receptor and signaling molecule. The netrin receptor 
DCC/Frazzled/UNC-40 has a related domain structure, with 6 Ig and 4 Fn domains and a 
similarly long cytoplasmic region (Keino-Masu et al., 1996; Chan et al., 1996; Kolodziej et 
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aL, 1996). The only currently known protein with a "5 + 3" organization is CDO (Kang et al., 
1997). However, CDO is only distantly related to Robo (15-33% a.a. identity between 
corresponding Ig and FN domains). 

We identified other "5 + 3" proteins in vertebrates whose amino acid identity exceeds 
that of CDO and represent Robo homologues. A human expressed sequence tag (EST; 
yu23dl 1, Accession #H77734) shows high homology to the second Ig domain of robo and 
was used to probe a human fetal brain cDNA library (Stratagene). The clones recovered 
correspond to a human gene with five Ig and three Fn domains (Figure 2). Exemplary 
functional Robo domains are listed in Tables 13-17 (the corresponding encoding nucleic acids 
are readily discernable from the corresponding nucleic acid sequences of Sequence Listing). 


Table 13. Exemplary domains of human Robo 1, by amino acid sequence positions 

Signal sequence: 

6-21 

First Immunoglobulin domain: 

68-167 

Second Immunoglobulin domain: 

168-258 

Third Immunoglobulin domain: 

259-350 

Fourth Immunoglobulin domain: 

351-450 

Fifth Immunoglobulin domain: 

451-546 

First Fibronectin domain: 

547-644 

Second Fibronectin domain: 

645-761 

Third Fibronectin domain: 

762-862 

Transmembrane domain: 

896-917 

Cytoplasmic motif #1 : 

1070-1079 

Cytoplasmic motif #2: 

1181-1195 

Cytoplasmic motif #3: 

1481-1488 


Table 14. Exemplary domains of human Robo II, by amino acid sequenc^positions 
Fourth Immunoglobulin domain: 1-91 
Fifth Immunoglobulin domain: 92-1 85 

First Fibronectin domain: 1 86-282 

Table 15. Exemplary domains of drosophila Robo 1, by amino acid sequence positions 
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Signal sequence: 

30-46 

First Immunoglobulin domain: 

56-152 

Second Immunoglobulin domain: 

153-251 

Third Immunoglobulin domain: 

252-344 

Fourth Immunoglobulin domain: 

345-440 

Fifth Immunoglobulin domain: 

441-535 

First Fibronectin domain: 

536-635 

Second Fibronectin domain: 

636-753 

Third Fibronectin domain: 

754-854 

Transmembrane domain: 

915-938 

Cytoplasmic motif #1 : 

1037-1046 

Cytoplasmic motif #2: 

1098-1119 

Cytoplasmic motif #3 : 

1262-1269 

Table 16. Exemplary domains of drosophila Robo II, by amino acid sequence positions 

Immunoglobulin domain #1: 

4-99 

Immunoglobulin domain #2: 

100-192 

Immunoglobulin domain #3 : 

193-296 

Immunoglobulin domain #4: 

297-396 

Immunoglobulin domain #5: 

397-494 

Fibronectin domain #1: 

495-595 

Fibronectin domain #2: 

596-770 

Fibronectin domain #3: 

771-877 

Transmembrane domain: 

906-929 

Conserved cytoplasmic motif #1 : 

1075-1084 

Table 17. Exemplary domains of C. elegans Robo 1, by amino acid sequence positions 

First Immunoglobulin domain: 

30-129 

Second Immunoglobulin domain: 

130-223 

Third Immunoglobulin domain: 

224-315 

Fourth Immunoglobulin domain: 

316-453 . 

Fifth Immunoglobulin domain: 

454-543 
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First Fibronectin domain: 


544-643 


Second Fibronectin domain 


644-766 


Third Fibronectin domain: 


767-865 


Transmembrane domain: 


900-922 


Cytoplasmic motif #1: 
Cytoplasmic motif #2: 
Cytoplasmic motif #3: 


1036-1045 


1153-1163 


1065-1074 


The homology is particularly high in the first two Ig domains (58% and 48% a. a. identity 
respectively, compared to 26% and 30% for the same two Ig domains between D-Robol and 
CDO) and together with the overall identity throughout the extracellular region and the 
presence of three conserved cytoplasmic motifs has led us to designate this as the human 
roundabout 1 gene {H-robol). Database searching reveals a nucleotide sequence 
corresponding to H-robol in the database, DUTT1, which differs in the signal sequence 
suggesting alternative splicing, a 9 bp insertion and seven single base pair changes. Five 
ESTs (see Experimental Procedures) show high sequence similarity to the cytoplasmic domain 
of H-robol. Sequencing of cDNAs isolated using one of these ESTs as a probe confirmed a 
second human roundabout gene (H-robo2). 

Degenerate PCR primers based on conserved sequences between H-robol and D- 
robol were used to isolate a PCR fragment from a rat embryonic El 3 brain cDNA library. 
The fragment was used to probe an El 3 spinal cord cDNA library, resulting in the isolation of 
a full length Rat robo gene {R-robol). The predicted protein shows high sequence identitiy 
(>95%) with H-robol over the entire length. The 5' sequences of different R-robol cDNA 
clones indicates that this gene is alternatively spliced in a similar fashion to H-robol IDUTTL 
We used a similar approach to isolate cDNA clones for R-robo2 9 which is highly homologous 
to H-robo2. 

The mouse EST vi92e02 is highly homologous to the cytoplasmic portionof ~H-fobo~l~ 
The C. elegans Sax-3 gene is also a robo homologue (Table 1; Zallen et al., 1997). A second 
Drosophila robo gene {D-robo2) is also predicted from analysis of genomic sequence in the 
public database. Taken together these data indicate that Robo is the founding member of a 
new subfamily of Ig superfamily proteins with at least one member in nematode, two in 
Drosophila, two in rat, and two in human. 
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The alignment of the Robo family proteins reveals that the first and second Ig domains 
are the most highly conserved portion of the extracellular domain. The cytoplasmic domains 
are highly divergent except for the presence of three highly conserved motifs (Table 18). 

Table 18. Conserved Cytoplasmic Motifs: Amino acid alignments of the three conserved 
cytoplasmic motifs are shown below the structure; in C.elegans robo, motifs #2 and #3 have 
been switched to provide a better alignment. 

Conserved Cytoplasmic Motif #1 

PDNPTPYATTMI IGTSS 1050 Drosophila roundabout - 1 (residues 1034-1050 
of SEQIDNO:2) 

SGQPTPYATTQLIQSNL 1083 Human roundabout - 1 (residues 1067-1083 of SEQ 


NAS PAPYATS S I LS PHQ 1088 Drosophila roundabout - II (residues 1072- 
1088 of SEQ ID NO:4) 

HDDPSPYATTTLVLSNQ 1049 C.elegans roundabout (residues 1033-1049 of 
SEQ ID NO:6) 


1070-1076 of SEQ ID NO:8) 

Conserved Cytoplasmic Motif #2 

INWSE . FLPPPPEHPPPSSTYG. Y 1119 Drosophila roundabout -I (residues 
1098-1 1 19 of SEQ ID NO:2) 

MNWAD . LLPPPPAHPPPHSNSEEY 12 02 Human roundabout -I (residues 1 180- 
1202 of SEQ ID NO:8) 

STWANVPLPPPPVQPLPGTELEHY 31 Human roundabout - 1 1 (residues 293- 
316 of SEQ ID NO: 10) ~~ 

KTLMD . FIPPPPSNPPPP . GGHVY 1168 C.elegans roundabout-I (residues 
1 147-1 168 of SEQ ID NO:6) 

nW. . .hhPPPP. PPP.S....Y Consensus (where h is 
-hydrophobic ) (residues 1148-1168 of SEQ ID NO:6) 


ID NO: 8) 


PtPYATT.hh. . . . 


Consensus (where h is I, L or V) (residues 
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Conserved Cytoplasmic Motif #3 

PSPMQPPPPVPVPEGW. Y 12 73 Drosophila roundabout -I (residues 1257- 
1273 of SEQIDNO:2) 

YTDDLPPPPVPPPAIKSP 1493 Human roundabout-I (residues 1476-1493 of 
SEQ ID NO:8) 

YADDLPPPPVPPPAIKSP 90 Mouse roundabout-I (residues 73-90 of SEQ ID 
NO: 12) 

RAPAMPTNPVPPEPPARY 1077 C.elegans roundabout (residues 1060-1077 of 
SEQ ID NO:6) 

PPPPVPPP .... Consensus (residues 78-85 of SEQ ID NO: 12) 

The consensus for the first motif is PtPYATTxhh, where x is any amino acid and h is I, L, or 
V. The presence of a tyrosine in the center of the motif indicates a site for phosphorylation. 
The other two motifs consist of runs of prolines separated by one or two amino acids and are 
reminiscent of binding sites for SH3 domains. In particular, the LPPP sequence in motif #2 
provides a good binding site for the Drosophila Enabled protein or its mammalian homologue 
Mena (Niebuhr et al., 1997). All three of these conserved sites can function as binding sites 
for domains (e.g. SH3 domains) of linker/adapter proteins functioning in Robo-mediated 
signal transduction. 

Robo is Regionally Expressed on Longitudinal Axons in the Drosophila Embryo. In 
order to determine the role that robo might play in regulating axon crossing behavior, we 
examined the robo expression pattern in the embryonic CNS. The in situ hybridization pattern 
of robo mRNA in Drosophila shows it to have elevated and widespread expression in the 
CNS. We raised a monoclonal antibody (MAb 13C9) against part of the extracellular portion 
(amino acids 404-725) of the protein to visualize Robo expression. Robo is first seen in the 
embryo weakly expressed in lateral stripes during germband extension. At the onset of 
germband retraction, Robo expression is observedliTthe neuroectodenTir"By the"end of stage— 
12, as the growth cones first extend, Robo is seen on growth cones which project ipsilaterally, 
including pCC, aCC, MP1, dMP2, and vMP2. Strikingly, little or no Robo expression is 
observed on commissural growth cones as they extend towards and across the midline. 
However, as these growth cones turn to project longitudinally, their level of Robo expression 
dramatically increases. Robo is expressed at high levels on all longitudinally-projecting 
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growth cones and axons. In contrast, Robo is expressed at nearly undetectable levels on 
commissural axons. This is striking since -90% of axons in the longitudinal tracts also have 
axon segments crossing in one of the commissures. Thus, Robo expression is regionally 
restricted. Robo expression is also seen at a low level throughout the epidermis and at a 
higher level at muscle attachment sites. In stage 16-17 embryos, faint Robo staining can be 
seen in the commissures but at levels much lower than observed in the longitudinal tracts. 

Immunoelectron Microscopy of Robo. We used immunoelectron microscopy to 
examine Robo localization at higher resolution. In stage 13 embryos, Robo is expressed at 
higher levels on growth cones and filopodia in the longitudinal tracts than on the longitudinal 
axons themselves. This localization is consistent with the model that Robo functions as a 
guidance receptor. The increased sensitivity of immunoelectron microscopy reveals the 
presence of very low levels of Robo protein on the surface of commissural axons. In addition, 
Robo-positive vesicles can be seen inside the commissural axons, possibly representing 
transport of Robo to the growth cone. Finally, by reconstructing the path of single axons by 
use of serial sections, we confirm that Robo expression is greatly up-regulated after individual 
axons turn from the commissure into a longitudinal tract. The expression of Robo on non- 
crossing and post-crossing axons and its higher level of expression on growth cones and its 
filopodia, provide a model where Robo functions as an axon guidance receptor for a repulsive 
midline cue. 

Transgenic Expression of Robo. We hypothesized that if Robo is indeed a growth 
cone receptor for a midline repellent, then pan-neural expression of Robo protein during the 
early stages of axon outgrowth might lead to a robo gain-of-function phenotype similar to the 
comm loss-of-function and opposite of the robo loss-of-function. To test this hypothesis, we 
cloned a robo cDNA containing the complete ORF but lacking most of its untranslated regions 
(UTRs) downstream of the UAS promoter in the pUAST vector and generated transgenic flies 
for use in the GAL4 system (Brand and Perrimon, 1993). Expression of robo in all neurons 
was achieved by crossing the UAS-robo flies to either theelav-G7tL~4 or scaBfous~GA'L4~\mQ%, 

Surprisingly, pan-neural expression of robo mRNA did not produce a strong axon 
scaffold phenotype as assayed with MAb BP 102. Staining with anti-Fas II (MAb 1D4) 
revealed subtle fasciculation defects, but overall the axon scaffold looked quite normal. An 
insight into why we failed to observe a stronger robo ectopic expression phenotype was 
provided by staining these embryos with the anti-Robo MAb. Interestingly, the Robo protein, 
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although expressed at higher levels than in wild type, remains restricted as in wild type, i.e., 
high levels of expression on the longitudinal portions of axons and very low levels on the 
commissures. This result indicates that there must be strong regulation of Robo expression, 
probably post-translational, that assures its localization to longitudinal axon segments. Such a 
mechanism could operate by the regulation of protein translation, transport, insertion, 
internalization and/or stability. 

We used these transgenic flies to rescue robo mutants. Expression of robo by the elav- 
GAL4 line in both robo 3 and robo 5 homozygotes rescued the midline crossing of Fas II 
positive axons including pCC and other identified neurons. 

Robo Appears to Function in a Cell Autonomous Fashion. To test whether Robo can 
function in a cell autonomous fashion, we used the UAS-robo transgene with the ftz ng -GAL4 
line (Lin et al., 1994) . The ftz ng -GAL4 line expresses in a subset of CNS neurons, including 
many of the earliest neurons to be affected by the robo mutation such as pCC, vMP2, dMP2, 
and MP1 . Expression of robo by the ftz ng -GAL4 line is sufficient to rescue these identified 
neurons in the robo mutant: pCC, which in robo mutants heads towards and crosses the 
midline, in these rescued embryos now projects ipsilaterally and does not cross the midline. 
When the same embryos were stained with the anti-robo MAb 13C9, we observed that all 
Robo-positive axons did not cross the midline. The ftz ng -GAL4 line drives expression in many 
of the axons in the pCC pathway (Lin et al., 1994), a medial longitudinal fascicle. In robo 
mutants, this axon fascicle freely crosses and circles the midline, joining with its contralateral 
pathway. When rescued by the ftz ng -GAL4 line driving XJAS-robo, this pathway now largely 
remains on its own side of the midline, even though occasionally a few axons cross the 
midline. These experiments support the notion that Robo can function in a cell autonomous 
fashion. 

Expression of Mammalian robol in the Rat Spinal Cord. The isolation of several 
vertebrate Robo homologues suggests that Robo may play a similar role in orchestrating 
midline crossing in the vertebrate nervous system as iTdoes"inT DrosopHila^Ifrthe Vertebrate 
spinal cord, the ventral midline is comprised of a unique group of cells called the floor plate 
(for review, Colamarino and Tessier-Lavigne, 1995). As in the Drosophila nervous system, 
the vertebrate spinal cord contains both crossing and non-crossing axons. Spinal commissural 
" neurons are bom in the dorsal half of the spinal cord; commissural axons project to and cross 
the floor plate before turning longitudinally in a rostral direction. In contrast, the axons of two 
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other classes of neurons, dorsal association neurons and ventral motor neurons, do not cross 
the floor plate (Altman and Bayer, 1984). 

To address the possibility that Robo may play a role in organizing the projections of 
these spinal neurons, we examined the expression of rat robol by RNA in situ hybridization. 
A rat robol riboprobe spanning the first three Ig domains was hybridized to transverse 
sections of El 3 rat spinal cord. At E13, when many commissural axons will have already 
extended across the floor plate (Altman and Bayer, 1984), rat robol is expressed at high levels 
in the dorsal spinal cord, in a pattern corresponding to the cell bodies of commissural neurons. 
Rat robol is also expressed at lower levels in a subpopulation of ventral cells in the region of 
the developing motor column. Interestingly, this expression pattern is similar to and overlaps 
partly with the mRNA encoding DCC, another Ig superfamily member which is also 
expressed on commissural and motor neurons and encodes a receptor for Netrin-1 (Keino- 
Masu et al, 1996). Rat robol is not, however, expressed in the either the floor plate or the roof 
plate of the spinal cord or in the dorsal root ganglia. This is in contrast to rat cdo, which is 
strongly expressed in the roof plate (KB, MT-L, and R. Krauss. In the periphery, rat robol is 
also found to be expressed in the the myotome and developing limb, in a pattern reminiscent 
of c-met (Ebens et al, 1996), indicating that rat robol may also be expressed by migrating 
muscle precursor cells. Therefore, like its Drosophila homologue, rat robol RNA is 
expressed by both crossing and non-crossing populations of axons, indicating that it encodes 
the functional equivalent of D-Robol. 

Genetic Stocks. All eight independent robo alleles were isolated on chromosomes 
deficient for Fasciclin III as described in Seeger et al., 1993. Subsequent use of a duplication 
that includes FasIII, and recombination of the robo chromosomes, indicates that the robo 
phenotype is independent of the absence of FasIIL Deficiencies were obtained from the 
Drosophila stock center at Bloomington, Indiana. 

Cloning and Molecular Analysis of the robo Genes. Start points for a molecular walk 
to robo were obtained from the Berkeley and Crete I>6sophila~^ 

Chromosomal walking was performed using standard techniques to isolate cosmids from the 
Tamkun library (Tamkun et al., 1992). cDNAs were isolated from the Zinn 9-12 hour 
Drosophila embryo gtl 1 library (Zinn et al., 1988), and from a human fetal brain library 
(Stratagene). Northern-blot of poly-A + RNA and reverse Northern blots were hybridized using 
sensitive Church conditions. 
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Sequencing of the cDNAs and genomic subclones was performed by the 
dideoxynucleotide chain termination method using Sequenase (USB) following the 
manufacturer's protocol and with the AutoRead kit or AutoCycle kit (Pharmacia) or by 33 P 
cycle sequencing. Reactions were analyzed on a Pharmacia LKB or ABI automated laser 
fluorescent DNA sequencers respectively. The cDNAs were sequenced completely on both 
strands. Sequence contigs were compiled using Lasergene, Intelligenetics, and 
AssemblyLIGN software (Kodak Eastman). Database searches were performed using BLAST 
(Altschuel et al., 1990). 

A full length D-robol cDNA was generated by ligating two partial cDNAs at an 
internal Hpal site and subcloning into the EcoRI site of pBluescript.SK+. A full length H- 
robol cDNA was synthesized by ligating an Xbal-Sall fragment from a cDNA and a PCR 
product coding for the carboxy-terminal 222 amino acids at a Sail site. The PCR product has 
an EcoRI site introduced at the stop codon. The ligation product was cloned into 
pBluescript.SK+ digested with Xbal and EcoRI. 

To clone the rat robol cDNA, degenerate oligonucleotide primers designed against 
sequences conserved between the 5 f ends of D-Robol and H-Robol were used to amplify a 
500 bp fragment from an El 3 rat brain cDNA by PCR. This fragment was used to screen an 
El 3 spinal cord library at high stringency, resulting in the isolation of a 4.2 kb cDNA clone 
comprising all but the last 700 nucleotides. Subsequent screenings of the library with non- 
overlapping probes from this cDNA led to the isolation of 4 partial and 7 full length clones. 
To clone the rat robo2 cDNA, we screened the same library with a fragment of the H-robo2 
cDNA. 

Expressed Sequence Tag and Genomic Sequences. The ESTs yu23dl 1 (#H77734), 
zr54gl2 (#AA236414) and yq76el2 (#H52936, #H52937) code for portions of H-Robol. The 
EST yq7el2 is aberrantly spliced to part of the human glycophorinB gene. Five ESTs 
yn50a07, yg02b06, ygl7b06, ynl3a04 and yml7gl 1 code for part of H-robo2. The 
Drosophila PI clone DS00329 encodes the genomic sequence of Z)-rafe"o27~Sequences 
1825710 and 182571 1 (both: #U88183; locus ZK377) code for the predicted sequence of C. 
elegans robo. The EST vi62e02 (#AA499193) codes for mouse robol. 

Identification of Molecular Defects In robo Alleles. Southern blots of robo alleles and 
their parental chromosomes were hybridized with fragments from the genomic cosmid clone 
106-1435 or partial cDNA clones to identify restriction fragment length polymorphisms 
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affecting the robo transcription unit. DNA was obtained from homozygous mutant embryos. 
35 cycles of the PCR was subsequently performed on the DNA obtained from half an embryo, 
t Primers specific for the region flanking the Cfol polymorphism used were : ROB06 (5- 
v\ GCATTGGGTCATCTGTAGAG -3') and ROB023 (5 ? -AGCTATCTGGAGGGAGGCAT-3'). 
The PCR products were purified on a Pharmacia H300 spin column and sequenced directly. 

Transformation of Drosophila, robo Rescue, and Overexpression. The 16 kb Xbal 
fragment from cosmid 106-1435 was cloned into the Drosophila transformation vector 
pCaSpeR3. Transformant lines were generated and mapped by standard procedures. Four 
independent lines were shown to rescue robo 1 3 5 alleles as judged by MAb 1D4 staining. 
< 0 PCR amplification of the D-robo ORF using the primers (5'- 

GAGTGGTGAATTCAACAGCACCAAAACCACAAAATGCATCCC-3 ? ) and (5 1 - 
W CGGGGAGTCTAGAACACTTCATCCTTAGGTG-3') produced a PCR product with an 
altered ribosome binding site that more closely matches the Drosophila consensus (Cavener, 
1987), and has only 21bp of 5 f UTR and no 3' UTR sequences. The PCR product was 
digested with EcoRI and Xbal and cloned into pBluescript (Stratagene) and subsequently, 
pUAST (Brand and Perrimon 1993). Transformant lines were crossed to elav-GAL4 and sca- 
GAL4 lines which express GAL4 in all neurons, or ftzng-GAL4 which expresses in a subset of 
CNS neurons (Lin et al, 1994). Embryos were assayed by staining with MAbs BP 102, 1D4 
and 13C9. For ectopic expression in the robo mutant background, the stocks robo 3 and robo 5 
(both protein nulls) were used. Crosses utilized the stocks w; robo/CyO; UAS-robo and w; 
robo/CyO; elav-GAL4. Due to the difficulty of maintaining a balanced stock, robo/+;ftz- 
ngGAL4/+ males were generated as required. 

Generation of Fusion Proteins and Antibodies. A six histidine tagged fusion protein 
was constructed by cloning amino acids 404-725 of the D-robo protein into the PstI site of the 
pQE31 vector (Qiagen). Fusion proteins were purified under denaturing conditions and 
subsequently dialyzed against PBS. Immunization of mice and MAb production followed 
standard protocols (Patel, 1994)1 ~~ ' 

RNA Localization and Protein Immunocytochemistry. Digoxigenin labeled antisense 
robo transcripts were generated from a subclone of a robo cDNA in Bluescript. In-situ tissue 
hybridization was performed as described in Tear et al., 1996. Immunocytochemistry was 
performed as described by Patel, 1994. MAb 1D4 was used at a dilution of 1:5 and BP 102 at 
1:10. For anti-robo staining, MAb 13C9 was diluted 1:10 in PBS with 0.1% Tween-20, and 
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the embryos were fixed and cracked so as to minimize exposure to methanol. The presence of 
triton and storage of embryos in methanol were both found to destroy the activity of MAb 
13C9. 

In situ hybridization of rat spinal cords was carried out essentially as described in Fan 
and Tessier-Lavigne, 1994. El 3 embryos were fixed in 4% paraformaldehyde, processed, 
embedded in OCT, and sectioned to 10 m. A l.Okb 35 S antisense rRobo riboprobe spannning 
the the first three immunoglobulin domains was used for hybridization. An additional non- 
overlapping probe was also used with identical results. DCC transcripts were detected as 
described in Keino-Masu et al., 1996. Immunohistochemistry against TAG-1 was carried out 
on 10 m transverse spinal cord sections using 4D7 monoclonal antibody (Dodd et al, 1988). 

Electron Microscopy. Canton S embryos were hand devitellinized, opened dorsally to 
remove the gut, and prepared for immunoelectron microscopy according to the procedures 
described previously (Lin et al., 1994), with the following modifications. The fixed embryos 
were incubated sequentially with MAb 13C9 (1:1) for 1-2 hours, biotinylated goat anti-mouse 
secondary antibody (1:250) for 1.5 hours, and then streptavidin-conjugated HRP (1:200) for 
1.5 hours. Hydrogen peroxide (0.01%) was used instead of glucose oxidase for the HRP -DAB 
reaction. 
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purposes of clarity of understanding, it will be readily apparent to those of ordinary skill in the 
art in light of the teachings of this invention that certain changes and modifications may be 
made thereto without departing from the spirit or scope of the appended claims. 
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